Submission to Mirex Ams Task 2011 – Sparse Coding Similarity Learning Method
نویسندگان
چکیده
This paper describes our submissions to the MIREX 2011 audio music similarity and retrieval task. The proposed method is based on a machine learning technique – sparse coding (SC). The music similarity is not directly obtained from computed distance measures on audio contents, instead, we predict a higher level similarity scores to match the listener’s subjective perceptions based on these distance measures and our pre-trained models. At the training stage, we will record the mapping of computed distance measures and the associated high level similarity scores which is estimated from human (expert) tags. Using the sparse coding techniques, this mapping is recorded using two jointly learned dictionaries that respectively store the representative computed distance and the high level similarity score patterns. Both the computed distance measures and the similarity scores can be represented as a sparse linear combination of the elements in the corresponding dictionary. At the testing stage, we will find the ratios that each element in the dictionary contributes to the newly computed distance measures, and then use this ratio to predict the corresponding similarity scores. 1. ALGORITHM OVERVIEW We illustrate the flowchart of our algorithm in Figure 1. At the training stage, we first take the expert labelled tags of the training audio to estimate multifaceted similarity scores, such as, general-similarity, style-similarity, moodsimilarity, etc. Then, for each song, we extract a set of computable acoustic features. Using these features, we compute multiple distance measures for each song-pair. Next, we jointly learn two dictionaries for the representative distance measure patterns and the associated similarity score patterns. Given any two testing songs in the testing stage, we will compute its distance measures to estimate the similarity scores. This document is licensed under the Creative Commons Attribution-Noncommercial-Share Alike 3.0 License. http://creativecommons.org/licenses/by-nc-sa/3.0/ c © 2010 The Authors. Acoustic
منابع مشابه
Mirex 2011: Automatic Audio Tag Classification via Sparse Coding
This extended abstract details our submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2011 for the audio tag classification task. First of all, we extract a fixed-length feature vector (composed of some timbral as well as modulation spectrum features) from each song clip. Then, by using l-reconstruction to represent each test song clip as a linear combination of all train...
متن کاملMirex 2011 Ams - Audio Similarity via Metric Learning
Our submissions (ML1, ML2, ML3) to the Audio Music Similarity (AMS) task are based upon learning an optimal distance metric over vector quantized MFCC histograms. ML1 is optimized to predict similarity derived from a collaborative filter; ML2 is optimized to predict genre similarity; ML3 is an unsupervised baseline which uses a native distance metric. This abstract details the system architectu...
متن کاملMirex 2011 - Ams Task: Multi Descriptor Based Algorithm
A method for the estimation of music similarity based on a combination of timbre, rhythm and tonal content of the songs, is presented in this report. The rhythm is estimated by evaluating the frequency distribution of the tempo pattern of the songs, while the tonal content is estimated by the calculation of an averaged chroma profile of the song. The algorithm is proposed to be submitted to the...
متن کاملMirex 2011 - Ams Task: Mfcc/variogram Based Algorithm
A method for the estimation of music similarity based on the use of the standardized variogram as clustering algorithm for Mel Frequency Cepstral Coefficients, is detailed in this report. The standardized variogram is used for the compression of the information of MFCCs. The algorithm is submitted to the Audio Music Similarity task of MIREX 2011, in occasion of the 12th ISMIR Conference.
متن کاملText Information Retrieval Approach to Music Information Retrieval
This MIREX submission for symbolic music similarity task adopts textual information retrieval methodology in the process of music information retrieval. The main contribution of this approach is to utilize well established term weighting methods for text retrieval and check their suitability for music data. We use a simple feature extraction method, so that the performance of an algorithm depen...
متن کامل